Learning with Randomized Majority Votes
نویسندگان
چکیده
We propose algorithms for producing weighted majority votes that learn by probing the empirical risk of a randomized (uniformly weighted) majority vote—instead of probing the zero-one loss, at some margin level, of the deterministic weighted majority vote as it is often proposed. The learning algorithms minimize a risk bound which is convex in the weights. Our numerical results indicate that learners producing a weighted majority vote based on the empirical risk of the randomized majority vote at some finite margin have no significant advantage over learners that achieve this same task based on the empirical risk at zero margin. We also find that it is sufficient for learners to minimize only the empirical risk of the randomized majority vote at a fixed number of voters without considering explicitly the entropy of the distribution of voters. Finally, our extensive numerical results indicate that the proposed learning algorithms are producing weighted majority votes that generally compare favorably to those produced by AdaBoost.
منابع مشابه
A PAC-Bayes Sample-compression Approach to Kernel Methods
We propose a PAC-Bayes sample compression approach to kernel methods that can accommodate any bounded similarity function and show that the support vector machine (SVM) classifier is a particular case of a more general class of data-dependent classifiers known as majority votes of samplecompressed classifiers. We provide novel risk bounds for these majority votes and learning algorithms that mi...
متن کاملRisk bounds for the majority vote: from a PAC-Bayesian analysis to a learning algorithm
We propose an extensive analysis of the behavior of majority votes in binary classification. In particular, we introduce a risk bound for majority votes, called the C-bound, that takes into account the average quality of the voters and their average disagreement. We also propose an extensive PAC-Bayesian analysis that shows how the C-bound can be estimated from various observations contained in...
متن کاملVoting Based Video Classification Using Clustering and Learning
In this paper, we propose video classification using linear manifolds (affine subspaces). In our method, we represent videos belonging to a same class by several linear manifolds using k-varieties clustering. When a test video is given, each frame of it votes for the class to which its nearest linear manifold belongs. According to this voting, the test video is classified into the class that ac...
متن کاملAn urn model for majority voting in classification ensembles
In this work we analyze the class prediction of parallel randomized ensembles by majority voting as an urn model. For a given test instance, the ensemble can be viewed as an urn of marbles of different colors. A marble represents an individual classifier. Its color represents the class label prediction of the corresponding classifier. The sequential querying of classifiers in the ensemble can b...
متن کاملMinorities and Storable Votes
The paper studies a simple voting system that has the potential to increase the power of minorities without sacrificing aggregate efficiency. Storable votes grant each voter a stock of votes to spend as desired over a series of binary decisions. By accumulating votes on issues that it deems most important, the minority can win occasionally. But because the majority typically can outvote it, the...
متن کامل